Toward the Principled Utilization of Contextual Evidence within the Information Retrieval Process: Investigating the theoretical foundation of Language Model Based Information Retrieval

نویسندگان

  • Leif Azzopardi
  • Mark Girolami
چکیده

The language modelling approach to Information Retrieval (IR) has generated much interest in the field since its conception in 1998[73]. However, some serious questions have been asked about the integrity of the language modelling approach. Specifically, it does not model relevance explicitly, unlike traditional probabilistic models of IR such as the Binary Independence Model[93]. Instead, it relies upon several underlying assumptions which are touted as being correlated with relevance. In this document, we provide a review of current state of the art language modelling approaches to IR and discuss the conjecture surrounding the language modelling approach. We then provide a study which analyzes the relationship between perplexity and Average Precision that underpins the language modelling approach. We conclude this document by detailing some potential future directions of the Ph.D, the expected contributions of the work and the proposed timetable.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of the Therapists’ Information Behavior in the diagnosis and treatment of mental disorders based on Kuhlthau's information retrieval process model

Background and Aim: Under the influence of various factors, people use different methods and methods to obtain information and express different information behaviors. These behaviors have been introduced in the form of patterns and models of information retrieval by information science experts in recent decades, which can be used in various fields. One of these areas that almost all people are...

متن کامل

Investigating the Impact of Authors’ Rank in Bibliographic Networks on Expertise Retrieval

Background and Aim: this research investigates the impact of authors’ rank in Bibliographic networks on document-centered model of Expertise Retrieval. Its purpose is to find out what kind of authors’ ranking in bibliographic networks can improve the performance of document-centered model.   Methodology: Current research is an experimental one. To operationalize research goals, a new test colle...

متن کامل

بررسی تأثیرات ریشه‌یابی در بازیابی اطلاعات در زبان فارسی

Using the language-specific behavior in information retrieval systems can improve the quality of the retrieved results significantly. Part of the word that remains after removing its affixes is called stem. Stemming process can be used for improving the relevancy of the results in information retrieval system. Different morphological variants of words (plural, past tense…) will be mapped into t...

متن کامل

Improved Skips for Faster Postings List Intersection

Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...

متن کامل

Improved Skips for Faster Postings List Intersection

Information retrieval can be achieved through computerized processes by generating a list of relevant responses to a query. The document processor, matching function and query analyzer are the main components of an information retrieval system. Document retrieval system is fundamentally based on: Boolean, vector-space, probabilistic, and language models. In this paper, a new methodology for mat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003